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■^ ' Abstract 



We prove that if T is a tree on n vertices with maximum degree A and the edge probabihty p{n) 
satisfies: np > CmaxlAlognjn*^} for some positive e > 0, then with high probabihty the random 
graph G{n,p) contains a copy of T. The obtained bound on the edge probabihty is shown to be 
essentially tight for A — n®'^^). 



o 
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c^ ! 1 Introduction 



In this paper we consider the problem of embedding a copy of a given spanning tree T on n vertices 
CN ' into the binomial random graph G{n,p). 

\^ • Embedding problems are one of the most classical subjects in Extremal and Probabilistic Combi- 

(N' 

en 

CN ' given family, in random graphs. Here we concentrate on embedding large trees in binomial random 

C^ ' graphs. 

(^ ' The problem of embedding large or nearly spanning trees in random graphs on n vertices (where by 

a nearly spanning tree we mean a tree T whose number of vertices is at most (1 — c)n for some constant 
c > 0) is a rather well researched subject, especially in the case of trees with bounded maximum degree, 

r> I see, e.g., [7j, [1], [9], [8], [10]. In particular, Alon, Sudakov and the author proved in [2] that for given 

e > and integer d there exists C = C{d,e) > such that whju the random graph G{n,p) with 
p = C/n has a copy of a tree T on (1 — e)n vertices of maximum degree at most d (in fact |2j proved 
that such a random graph contains whp a copy of every such tree); better constant dependence and 
the resilience version of this result have recently been obtained in [3] and [1] , respectively. 

In contrast, nearly nothing has been known for the case of embedding spanning trees. Even the 
case of embedding spanning trees of bounded maximum degree appears to be unaddressed, apart from 
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An event £"„ occurs with high probability, or whp for brevity, in the probability space G{n,p) if limn-»oo P»"[G ~ 
G{n,p) £gn] = 1. 



some sporadic cases. Of course, the most classical result is about embedding a Hamilton path, or 
even a Hamilton cycle, in G{n,p); Komlos and Szemeredi [Hj and independently Bollobas [Sj proved 
that ii p{n) > -^ — ^^ , where io{l) is any function tending to infinity arbitrarily slowly with n, 

then whp G{n,p) contains a Hamilton cycle. Alon et al. [2] observed that if a tree T has a linear 
in n number of leaves, then ■whp G{n,Clnn/n) contains a copy of T for some large enough C > 0; 
the proof is not that hard and utilizes the embedding result for nearly spanning trees from the same 
paper. However, no general result for this problem has been obtained, and even the case of the comb 
(which is the path Pq of length ^/n — 1 with disjoint paths of length -y/n — 1 attached to each vertex 
of Pq), interpolating in some sense between the above mentioned solved cases, is open; this natural 
question has been communicated to us by Jeff Kahn [12] . 

Here we make a substantial step forward in solving this class of problems. Our main result if the 
following embedding theorem. 

Theorem 1 Let T be a tree on n vertices of maximum degree A. Let < e < 1 be a constant. Lf 

up > — Amn + n , 
e 

then whp a random graph G(n,p) contains a copy ofT. 

In other words, starting from A(T) = n"^, edge probability p = — — ^^ is enough to get whp a copy 
of T in G{n,p). 

It is not hard to see that the dependence of p on A, posted in Theorem [H is optimal up to a 
constant factor in the range np = n®^^'. In order to state this result formally, for integers n > A > 3 
define the tree T{n,A) as follows. Write n = (A — l)k — r, where < r < A — 2. Take a path 
P = (vi, . . . ,Vk) with k vertices, attach to vi, . . . ,Vk-i vertex disjoint stars with A — 2 leaves each, 
and finally attach to v^ a star with A — 2 — r leaves. The tree T{n, A) has n vertices and is of maximum 
degree at most A. For future reference observe that the k vertices of P dominate the remaining n — k 
vertices of T(n, A). 

Theorem 2 For every e > there exists 5 > such that if n'^ < A < j^, then a random graph 
G{n,p) with p = '^" whp does not contain a copy ofT{n,A). 

There is a certain similarity in appearances between the above results and the theorem of Komlos, 
Sarkozy and Szemeredi [13], who proved that for 6 > and all large enough n, any graph G on n 
vertices of minimum degree at least {l/2+S)n contains a copy of every tree T on n vertices of maximum 
degree A(r) < en/ Inn, where c = c{6) is a small enough constant; they noticed that their condition 
on A(r) is essentially tight too (actually because of the random graph G{n,p) with 1/2 < p < 1 and 
the above described tree T(n,A), just like in our Theorem [2]). The arguments of [13] are naturally 
very different and do not seem to have much bearing on the situation in (sparse) random graphs. 

In order to ease the reader's task we now give a brief description of the proof of Theorem [TJ The 
key definition used is that of a bare path: 



Definition 1 A path P in a tree T is called bare if all vertices of P have degree exactly two in T. 

In the proof of Theorem [H we first argue that every tree T on n vertices has a linear in n number 
of leaves, or a collection of vertex disjoint bare paths of (large) constant length each (Lemma I2.ip . 
The former case is rather easy; similarly to the argument outlined in [2], we first embed the subtree 
F of T, obtained by deleting from T a linear number of leaves, by a straightforward greedy algorithm 
(Lemma I2.2p . Then we embed the remaining edges between the omitted leaves of T and their fathers 
(Lemma 12. 3p : the restriction np > CAlnn is induced by this part. In the complementary case, where 
the number of leaves of T is relatively small, we first take out a linear number of disjoint constant 
length bare paths to obtain a subforest F of T; we embed F using the same greedy argument (Lemma 
12. 2|) . Then we are left with embedding the remaining bare paths; we do this by reducing the problem 
to that of finding a factor of cycles (with some extra conditions imposed) in a random graph, and 
then by invoking a beautiful result of Johansson, Kahn and Vu [11] about factors in random graphs 
(Lemma 12. 4p . In both above described cases we need our random edges to come in two independent 
chunks: the standard trick of representing G ~ G{n,p) as G = Gi U G2, where Gj ~ G{n,pi) and 

1 — p = {1 — pi){l — P2), allows for this readily. 

The notation used in the paper is pretty standard. We systematically suppress rounding signs for 
the sake of clarity of presentation. 

The proofs of Theorems [T] and [2] are given in the next section. The last section of the paper is 
devoted to concluding remarks. 

2 Proofs 

Lemma 2.1 Let k,l,n > be integers. Let T be a tree on n vertices with at most I leaves. Then T 
contains a collection of at least "~^ ^ '-^ vertex disjoint bare paths of length k each. 

Proof. Define 



Vi 


= {veV{T) 


d{v) = 1} , 


V2 


= {veV{T) 


d{v) = 2} , 


V3 


= {v£V{T) 


d{v) > 3} . 



Clearly Vi is the set of leaves of T and thus satisfies \Vi\ < I. We have: 



2n-2 = 2|^(r)| = ^ d{v) > 1^11+21^21+31^3 

v<^V(T) 



31 



2(|l^l| + |1^2| + |^3|) + (|^3|-|^l 
2n +1^31-1^11, 



implying l^sl < |Vi| - 2 < / - 2. 



T has IV1I + IV3I — 1 < 2/ — 3 internally disjoint paths connecting between the vertices of FiU V3, with 
all internal vertices of these paths being of degree two. In each such path, pick a largest collection of 
vertex disjoint subpaths of length k. This leaves at most k vertices of the path uncovered, so altogether 
the so formed collection of bare paths of length A; in T contains all but at most (|Vi| + IV3I) + (1^41 + 
I V3 1 — 1) A; < (2/ — 2) (A;+ 1) vertices, implying that the total number of paths of length k in the collection 

!/-2)(; 

k+1 



is at least ^ ,,'} as required. D 



Lemma 2.2 Let < a < 1 be a constant. Let F be a tree on (1 — a)n vertices of maximum degree A. 
If anp > 3A + 51nn, then whp a random graph G{n,p) contains a copy of F. 

Proof. Choose arbitrarily a root r of F and fix some search order vr, say BFS, on F starting from r. 
Let TT = {vi = r, . . . , Vm) with m = (1 — a)n. We will embed F in G ~ G{n,p) according to vr. Let (j) 
be the so constructed embedding. 

Suppose we are to embed the children of a current vertex Vi, 1 < i < m — 1, in G . Let Ui C [n] be 
the set of vertices already used for embedding, clearly \Ui\ < m. Expose the edges of G from (j){vi) to 
[n] — Ui. We need to find at most dpivi) < A neighbors of 0(fi) outside Ui. The probability of this 
not happening is at most 

(nnp-A)^ 2a„p 1 

Pr[Bin[n — m,p) < A] < e ^anp < g g ^ — . 

n 

Taking the union bound over all embedding steps, we conclude that whp G contains a copy of F. D 

Lemma 2.3 Let < di, . . . ,dk be integers satisfying: di < A, Yli=i d-i = I- Let A = {ai, . . . , a^}, B 
be disjoint sets of vertices with \B\ = I. Let G be a random bipartite graph with sides A and B, where 
each pair (a, h), a G ^4, 6 G B, is an edge of G with probability p, independently of other pairs. If 

2A In / 

then whp as /—)• oo the random graph G contains a collection Si, . . . ,Sk of vertex disjoint stars such 
that Si is centered at Oj and has the remaining di vertices in B. 

Proof. Define an auxiliary (random) bipartite graph G' with sides A' and B, where \A'\ = \B\ = I. 
The vertices of A' are partitioned into k pairwise disjoint sets Ai, . . . ,Ak with |Aj| = dj, 1 < i < A:. 
G has an edge between a' G Ai and b £ B with probability pi, where (1 — pi) "■ = 1 — p, implying 
Pi > p/di > p/A. The distribution G' induces the distribution of G by the obvious projection: G has 
an edge between a £ A and b € B iS G' has some edge between Ai and B. Observe that if G' has a 
perfect matching M' then G has the desired collection of stars {Si}, obtained by projecting A' back 
into A (the vertices of Ai are projected onto Oj). 

By the classical results about random graphs (see, e.g.. Section 7.3 of [6]) and the monotonicity of 
the property of having a perfect matching it is enough to require that all individual edge probabilities 
in G' are at least (In/ + uj{l))/l. Recalling that pi > p/A, we see that the lemma's assumption 
p > •^ , guarantees the required condition. D 



Lemma 2.4 Let k > 3 be a fixed integer. Let G be distributed as G{{k+l)nQ,p). Let S = {si, . . . , Sng}, 
T = {ti, . . . ,tn(,} be disjoint vertex subsets of [{k + l)no]. // 

(lnno\ 

for some large enough constant C = C{k), then whp G contains a family {Pi\^li of vertex disjoint 
paths, where Pi is a path of length k connecting Si and ti . 

Proof. Fix a partition of y(G) — S'UT into vertex disjoint subsets Vi, ..., Vfc_i of cardinality \Vi\ = uq 
each. Define an auxiliary graph H with vertex set V{H) = XUViU. . .UV^-i, where X = {xi, . . . , x„,q}. 
For 1 < i < k — 2, the edges of H between Vi and T^+i are identical to those of G. For v & Vi and 
Xj £ X, {v,Xj) is an edge of H iff {v,Sj) is an edge of G. Similarly, for v € V^-i and Xj £ X, {v,Xj) 
is an edge of H iff {v,tj) is an edge of G. Notice that each relevant pair in V{H) becomes an edge of 
H independently and with probability p. 

Suppose now that H contains a Cfc-factor {^i, . . . , S„o}, where each cycle 5^ traverses the sets 
X,Vi, . . . , Vfc-i in this order. Each such cycle Si translates to a path of length k between Si and ti in 
G, and these paths are pairwise disjoint. 

It thus remains to argue that the random graph H contains whp the desired collection of cycles. 
This can be obtained from the result of Johansson, Kahn and Vu [11] through straightforward (but 
quite tedious) modification of their arguments. (They proved that a random graph G{kn,p) with 
p > C{k) ( fc"i ) contains whp a factor of cycles Cfc, we need the factor in a k-partite random 
graph; moreover, the cycles in the factor are required to traverse the parts in the prescribed order.) D 

Proof of Theorem [B Set 
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We consider two cases. 

Case 1. T has at least 6n leaves. 

We represent G as the union E{G) = E{Gi) U E{G2), where Gi, G2 are two independent random 

graphs, both distributed according to G{n,p') with 1 — p = (1 — p'Y (and thus p' > p/2). Let -F be a 

subtree of T obtained by deleting from T an arbitrary set of 6n leaves. We first find a copy (j){F) of F 

in Gi - such a copy exists whp due to Lemma \T2[ Let now B = [n] — V{(p{F)), and let A C F(0(F)) 

be the set of images of the fathers of the 5n leaves deleted from T to form F. Denote A = {ai, . . . , a^}, 

and let dj < A be the number of leaves in T connected to the preimage (j)~^{ai) and left outside F; 

clearly ^i^i di = 5n. In order to complete the embedding of T into G, we need to find in G k vertex 

disjoint stars Si, ... ,3^, where the star Si is centered in Oj and has the remaining di vertices in B. We 



invoke Lemma 12.31 to find such stars whp using the (random) edges of G2 between A and B. Since 
the edge probability in G2 is at least p/2, we need to verify that 

p 2Aln{6n) 
2 - 6n ■ 

Recalling that S = jq and p > ^^ m"" ' ^^ ^^^ ^^^* *^^^ condition is fulfilled indeed. 

Case 2. T has less than 6n leaves. 

We again represent G as the union E{G) = E{Gi) U E{G2) as in the previous case. According to 

Lemma [271} T contains a family of no = '^^ — T+i — 0(^) of vertex disjoint bare paths of length 

k. Let F be a subforest of T obtained by deleting the internal vertices of such a family of bare paths. 

We first use the edges of Gi to find whp a copy 4>{F) of F; this is possible again due to Lemma [2?2l 

It now remains to insert these ng bare paths, connecting between prescribed pairs of vertices. We can 

apply Lemma [23] to the edges of G2 to meet this goal. Since the edge probability in G2 is at least p/2 

and 

p f^-i+e / Inn ^ ' 
> — ^ — » 



2 - 2 V™^"\ 

(recall e > l/fc), the graph G2 contains indeed the required collection of paths whp. The proof is 
complete. D 

Proof of Theorem [2l Set 

n 



A-1 

Consider the random graph G{n,p) with p = ^^, the value of (5 = (5(e) to be chosen later. Recall 

that in the tree T{n, A) k vertices of the spine path P dominate the rest of the graph. It thus suffices 
to show that whp G{n,p) has no dominating set of size k. The probability that such a dominating 
set exists is at most 



M (^ _ (^ _ p^^k^n-k < (^^y ^-in-m-p)^ 



< (3A)a e-z'^ < e 



pfe 2TilnA n-*- ^ 



Taking 6 = e/2 we see that whp the random graph G{n,p) does not contain a dominating set of size 
k and thus whp does not contain a copy of T{n, A). D 

3 Concluding remarks 

We have shown that the (pretty immediate) lower bound on the edge probability p{n) > cA(T) In n/n 
for the random graph G{n,p) to contain whp a copy of a given spanning tree T of maximum degree 
A is tight up to a constant factor in the range A = n '^^ 



The regime A(T) = n°^^' stays largely open. In particular, we were not able to provide a satisfac- 
tory solution for the most natural case of embedding spanning trees with bounded maximum degree. 
Our result only shows that in this case is is enough to require p{n) = n~^^°^^'; this is probably not 
the tightest bound possible. 

For the case of embedding a bounded degree spanning tree T with en leaves [2] has shown that it 
is enough to take p{n) = C In n/n, where C may depend on c. It is unclear whether such a dependence 
is necessary. It seems plausible that assuming p{n) = (1 + o(l)) In n/n may be enough. 

Finally, it would be very interesting to obtain sufficient conditions for embedding spanning trees 
with given maximum degree applicable to pseudo-random graphs. 
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